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Abstract — Practical random network coding based scliemes 
for multicast include a header in each packet that records 
the transformation between the sources and the terminal. The 
header introduces an overhead that can be significant in certain 
scenarios. In previous work, parity check matrices of error 
control codes along with error decoding were used to reduce 
this overhead. In this work we propose novel packet formats that 
allow us to use erasure decoding and list decoding. Both schemes 
have a smaller overhead compared to the error decoding based 
scheme, when the number of sources combined in a packet is 
not too small. 

Index Terms — network coding, network coding overhead, era- 
sure decoding, list decoding. 



I. Introduction 

In a multicast scenario, network coding can achieve 
maximum-flow-min-cut capacity. It is shown in HI, 13 that if 
each intermediate node transmits random linear combinations 
of the incoming packets over a large field, the terminal can 
recover the source packets with high probability. Under such 
a distributed randomized scheme, the terminals need to know 
the transfer matrix. In ||3| it was shown that this can be carried 
in the headers of the packets. The header records the network 
coding vector, which consists of the linear combination coeffi- 
cients for the packet. The header length equals to the number 
of source packets, which is negligible when the packet length 
is large and the number of sources is relatively small. 

There are situations in which the packet overhead can be 
significant. As noted in 14), in sensor networks, the number of 
sources is large and current sensor technology does not allow 
transmission and reception of very large packets. However, in 
many of these applications, the network topology is such that 
the received packets at a terminal only consist of combinations 
of a small or moderate number of sources. In addition, the 
random network coding protocol can possibly be appropriately 
modified to enforce the constraint that a received packet 
contains combinations of only a few sources. This implies 
that it may be possible to "compress" the header size and 
reduce the overhead. The idea of compressing coding vectors 
was first proposed in ID, where a strategy using parity-check 
matrices of error control codes was used. Under that scheme, 
the overhead of each packet has length 27n if the maximum 
number of packets being combined in the packet is m. 

Suppose the total number of sources is n. As mentioned 
in m, the restriction on the number of combined packets 

The authors are with the Department of Electrical and Computer En- 
gineering, Iowa State University, Ames, Iowa 50011, USA. Email: {szli, 
adity ar} @ iastate . edu 

This research was supported in part by NSF grant CNS-0721453. 



introduces n — m zeros in each row of the transfer matrix, 
which may affect the invertibility of the matrix. The network 
topology in general will make the distribution of zeros non- 
uniform and this makes the chance of losing rank becomes 
larger. Therefore, the value of m can not be too small. 
Main Contributions - In this work, we propose improved 
schemes for the compression of network coding vectors. 
1) In the first scheme, we add an ID segment to the header 
that records the IDs of the sources being combined in the 
packet. This requires modifying the intermediate node oper- 
ation slightly but gives two main advantages: a) It allows us 
to convert the problem at the terminal into one of decoding 
erasures (as against decoding errors). The required header 
length becomes m + n/ \ogq (the base of the logarithm is two 
throughout the paper), where q is the field size. It is less than 
the overhead of the error decoding based scheme (2m) when 
m is not too small, b) The protocol suggested in [4J to limit 
the number of sources combined in a packet adds a counter to 
each packet for tracking the number of sources that have been 
combined. However, when combining two incoming packets, it 
is hard for the intermediate node to know the number of source 
packets that will be combined in the new packet because the 
sets of source packets in the incoming packets may overlap. 
It can only obtain an inaccurate upper bound by adding two 
counters together. Using our proposed ID segment, the number 
of source packets being combined in every coded packet can be 
accurately traced. 2) In the second scheme, we propose a list- 
decoding based compression scheme (based on error decoding 
like [:4J) , whose overhead can be made arbitrarily close to 
TO + 0(logn)/logg. In this scheme the intermediate nodes 
remain oblivious to the fact the network coding vectors are 
compressed. The lower overhead for this scheme comes at the 
expense of higher decoding complexity (for the header) at the 
terminal. 

II. Background and Related Work 

Let Fq denote a finite field with size q, where q is a power of 
two. Consider a network with n sources, not necessarily col- 
located. The i"* source transmits a length- packet pi G i^^^. 
The packet contains two parts: = [pP|pf^], where £ 
F!^ is the header and pf^ e F^~'^ is the actual message. 



packet received by a terminal is = [rf^jrf^], where 



The i*'' 

denotes the header and rf^ denotes the coded message. 
In 131, the header, p^ is designed to be the i*^ row of an 
n-by-n identity matrix. Thus, under random network coding, 
rf contains the overall transformation from the sources to the 
terminal for the coded message rf. The length of the header 
h ^ n. Denote the vector of transformation coefficients by q^. 



In general, the entries of could be all non-zero since 
all sources could be combined. Under the assumption that at 
most m sources are combined, contains at most m non- 
zero entries, which leads us to an error control coding based 
compression ||4)- Let _ff be a parity check matrix of a (n, k, d) 
linear block code, where d is the minimum distance ||5]. In a 
channel coding setting, a codeword x such that kH^ = is 
transmitted, and y = x + e is received, where e denotes the 
error The decoder computes the syndrome (of length n — k) 
yH^ = eH^ = s and finds the error pattern e. As long as 
the actual Hamming weight of e, wt{e) < [{d— 1)/2J, e can 
be recovered exactly. This can be done efficiently for codes 
such as RS and BCH using the Berlekamp-Massey algorithm 
(BMA) 0. Equivalently, we can reconstruct e (of length-n) 
from s (of length n~k) and this can be viewed as a method to 
compress a vector e. For an error pattern such that wt{e) < m, 
to get a high compression rate, we want k to be as large as 
possible while the minimum distance is d and the code length 
is n. From the Singleton bound IS), k<n — d+l = n— 2m 
and the well known RS codes achieve this with equality.. 

In the error-correction based compression scheme 14J, the 
header of the packet injected in the network is chosen to 
be = UH^. After random linear coding, the i*'' received 
packet contains the header rf = (liH^. Note that the network 
coding vector q^ is a length-n vector with wt{qi) < m and 
is available at the terminal. Thus, the problem of recovering 
qi is equivalent to error correction. Then the n headers can be 
stacked row by row, forming the n-by-n transfer matrix. The 
overhead is h ~ n — k and the maximum number of sources 
allowed to be combined in one packet is m < [h/2\. 

III. Erasure decoding based compression scheme 

In channel coding, an erasure is defined to be an error whose 
location is known by the decoder For a linear block code with 
minimum distance d, it can correct up to d — 1 erasures. For 
BCH codes and RS codes, syndrome-based decoding and the 
BMA still work after some minor modifications iSj. In the 
network coding vector compression scenario, if we know the 
locations of non-zero elements in q^, we can allow m to be 
as large as d — 1 < n — k. Note that as long as we know 
which source packets are combined in the packet of interest, 
we know the locations of the non-zero elements. 
Proposed Solution. - We add a bit array of length-?i to the 
header pf^ and call it ID segment. At the j*'' source, only the 
j*'' position is set to 1 and others are 0. At every intermediate 
node, when several incoming packets are combined to form a 
packet for an outgoing edge, the ID segment of the outgoing 
packet is the bit-wise OR of the ID segments of the incoming 
packets, pf also includes iiH^ (of length n — k) as before. 
This protocol is very easy to implement and every packet in the 
network knows exactly which source packets are combined in 
it. The element of q^ is non-zero if and only if the j*'' bit in 
the ID segment of is 1 . As pointed out in the introduction, 
if we want to limit the number of source packets being com- 
bined by network protocol, this information is important for 
the intermediate nodes. The terminal receives the "syndrome" 
(\iH^ and knows the locations of the "errors". By erasure 
decoding, it can recover q^ as long as wt(qi) < m = n — fc. 



The length of the ID segment in terms of symbols is 
n/\ogq. The total overhead is n — k + n/ \ogq. If m is fixed, 
the overhead for the scheme in ||4l is 2m and the overhead 
for our erasure decoding scheme is m + n/\ogq. Thus, if m 
is not too small, our proposed scheme has less overhead. 

Example 1. Suppose n — 50, q = 2*,m = 15. Under 
error decoding scheme, a (50, 20) RS code is required and 
the overhead is 30 bytes. Under erasure decoding scheme, a 
(50, 35) RS code is required and the overhead is 22 bytes, a 
saving of 26%. According to the current ZigBee standard iH, 
the packet size is 128 bytes. 

Example 2. Suppose n — 255, q — 2^,m ~ 150. No code 
has minimum distance 301 with code length 255. Under error 
decoding the network coding vector cannot be compressed and 
the overhead h ~ n — 255. Under erasure decoding scheme, 
a (255, 105) RS code can be used and h = 182. 

A reviewer has pointed out that if one uses a bit-array to 
record the IDs of the sources, then there is an alternative 
scheme that does not require decoding at the terminals. Basi- 
cally, every node keeps track of the coefficients and the ID's 
and combines them so that the net transformation is available 
at the terminals without decoding. However, such a scheme 
requires the intermediate nodes to scan the headers of the 
incoming packets to locate the corresponding coefficients that 
need to be combined (in addition to performing a bitwise 
OR in the ID array). This solution increases the processing 
complexity at the intermediate nodes. Our proposed approach 
can be viewed as an alternate solution to this problem. The 
correct choice would depend upon the capabilities of the 
sensor nodes and the application requirements. 

IV. List decoding based compression scheme 

In this section, we show that the overhead of the strategy 
based on error decoding (such as I?]) can be reduced by 
using list decoding at the terminal. It does not require the 
decoder to know the error locations so we need not add 
the ID segment in the header Furthermore, the intermediate 
nodes simply perform linear combination on the header, i.e., 
it is oblivious to the fact the network coding vectors are 
compressed. In the channel coding scenario, given the received 
word y = x + e, the decoder tries to find a codeword x 
within Hamming distance = [{d— 1)/2J of y. As long as 
wt{e) < to, the decoder will find a unique x and the decoding 
is successful. When wt{e) > io, there is no guarantee that the 
decoder will succeed. This is the scenario in which the notion 
of list decoding is useful. The list decoding problem can be 
stated as follows. 

Problem 1. Given a received word y = x + e, find the list 
of all codewords x'i within Hamming distance t > to of y. 

As long as wt{e) < t, the actual codeword x will appear in 
the list. The list decoding problem has been solved to some 
extent (see ||71 for a survey). Efficient list decoding algorithms 
with polynomial sized lists for RS codes up to a radius of 
(t = ?i — ^/nk) are known. The class of folded RS codes Q 
can be decoded arbitrarily close to the Singleton bound, i.e., 
t can be close to n — fc, though this is possible only with 
very large alphabets. In order to apply list decoding to our 
problem, we propose a packet header for the i*^ source packet 
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that consists of UH^ and some side information. Note that at 
the terminal, we obtain the syndrome s = eH^ = (\iH^ of 
network coding vector .Therefore, the problem can be stated 
as follows. 

Problem 2. Find the list of all possible error pattern e 's 
such that eH — s and wt(e) < t, where t > to- 

We present a problem transformation such that all list 
decoding algorithms for problem 1 can be used to solve 
problem 2. Given s = eH^ , we can find an arbitrary y such 
that s = yH^, then use this y as input to problem 1 and get 
the list of x's as an output, then e = x + y form the list of 
e's. Such y can be chosen easily. Recall that the parity check 
matrix iJ of a (n, k) code has rank {n — k) and there exist 
(n — k) columns in H that are linearly independent. Let the 
elements of y that correspond to these columns be unknowns 
and other k elements be zero. Note if a RS code is used, 
we can choose any k elements in y to be zero. The system of 
equations s = yi?"^ has [n — k) unknowns and {n~k) linearly 
independent equations, from which y can be determined. Next, 
we prove that the above transformation solves problem 2 
correctly. Suppose the resultant list of problem 2 is a set 
Li and the list obtained by using our transformation is a 
set L2- We need to show Li = L2- First, if e G L2, since 
e = X + y and x and y differ at most t positions, wt{e) < t 
and eH^ = xiJ^ + yi7^ = + yH'^ = s, then e e Li. 
Second, if e e Li, there exists an x = y + e such that 
xH'^ = yiJ^ + eH'^ = and since wt{e) < t, A(x, y) < t 
(A(-) denotes Hamming distance), this means x is a codeword 
within Hamming distance t of y, then x is on the list of the 
output of problem 1. Thus e G L2- 

Note that so far we have only found a list of possible error 
patterns. In practice we need to find the unique error pattern 
as the decoded network coding vector. The small amount of 
side information included in the header is useful here. The side 
information generation problem was solved in |j8] Theorem 2]. 
It is a hash function based algorithm to select a message in 
a candidate set and works no matter we are facing problem 
1 or problem 2. Note that in our compression problem, the 
message space is all possible network coding vectors and the 
size is g". The side information at the terminal should contain 
im Lemma 1] (i) • g^, where is the actual "message" 
(network coding vector), is a randomly chosen column of 
the generator matrix of a low rate RS code (which is different 
from the one used to generate the syndrome) and • denotes 
inner product, and (ii) the random number r. Denote the list 
of candidates to be {q^, . . . , qf }. The terminal knows the RS 
code a priori and computes q^ g^ for every j and finds j* such 
that q^* • gr = qi ■ gr ■ Since the actual q; is in the list, such 
a j* exists. It was shown in [8] Theorem 2] that as long as 
0(log n) + 0(log L) + 0{\og{l/Pf)) bits of side information 
are provided, the probability that j* is not unique is less than 
Pf. The basic idea behind this is that for two codewords of 
a RS code with very large minimum distance, the probability 
that the symbols at a random chosen position r are equal is 
very small. The list size L is polynomial with n. Thus, the 
amount of side information needed is O(logn) and Pf is the 
probability of failure to find a unique output. In order to obtain 
the side information at the terminal, we include • g^ in the 



header of the i source packets and the intermediate nodes 
perform linear combination on it, so that the terminal receives 
qi • gr- We can let the session ID to be the random number r 
and available to the sources and terminals so that r does not 
need to be transmitted over the network. 

The list decoding based scheme incurs an overhead of 
length m + 0{\ogn) / \ogq and allow the number of source 
packets being combined to be rn. It has smaller overhead size 
than erasure decoding based scheme. However, as mentioned 
before, in order to approach the list decoding capacity, the field 
size needs to be large and the decoding algorithm becomes 
more complicated. If we use ordinary RS codes and the 
efficient decoding algorithms that corrects up to n—\fnk errors 
to compress network coding vector, the overhead length will 
be 2m — ni} jn + 0(logn)/log(7. Usually this will be less 
than the overhead of error decoding based scheme but greater 
than erasure decoding based scheme. 

Example 3. Suppose n = 255, q = 2^,m = 86. We use a 
(255, 112) RS code. The syndrome length is 143 and the side 
information length is [30/8] for Pf = 0.0001, so ft. = 147. h 
equals 172 or 118 for error or erasure decoding respectively. 

V. CONCLUSION 

We proposed erasure decoding based and list decoding 
based approaches to improve the compression of network 
coding vectors. Table |I] compares the overheads of the various 
schemes. For moderate or large value of m, that may be 
necessary to support the multicast rate, both schemes have less 
overhead than the error decoding based scheme. Our investi- 
gation reveals that the list decoding based scheme has a lower 
overhead with respect to the erasure coding based scheme, 
when capacity achieving codes are used. However, from a 
practical perspective, the erasure coding scheme offers the best 
tradeoff between overhead and implementation complexity. 



TABLE I 

Comparison of three schemes for the same m. 





Header format 


Header length 


EiTor 


Syndrome 


2m 


Erasure 


Syndrome 
+ ID segment 


m + "/ log q 


List 


Syndrome 
+ side information 


m + 0(log n) 1 log q 
or 2m — rr? 1 n -\- (log n) / log q 
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